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CLAIMS : 

1. A method of summarizing digital audio data comprising 
the steps of: 

analyzing the audio data to identify a representation 
of the audio data having at least one calculated feature 
characteristic of the audio data; 

classifying the audio data on the basis of the 
representation into a category selected from at least two 
categories; and 

generating an acoustic signal representative of a 
summarization of the digital audio data, wherein the 
summarization is dependent on the selected category. 

2. A method as claimed in claim 1, wherein the analyzing 
step further comprises segmenting audio data into 
segment frames, and overlapping the frames, 

3. A method as claimed in claim 2, wherein the classifying 
step further comprises classifying the frames into a 
category by collecting training data from each frame and 
determining classification parameters by using a training 
calculation. 

4. A method as claimed in any preceding claim, wherein the 
calculated feature comprises perceptual and subjective 
features related to music content. 

6. A method as claimed in claim 3, wherein the training 
calculation comprises a statistical learning algorithm 




wo 2004/049188 



PCT/SG2002/000279 



18 



wherein the statistical learning algorithm Is Hidden 
Markov Model, Neural Network, or Support Vector 
Machine. 

6. A method as claimed in any preceding claim, wherein the 
type of acoustic signal Is music. 

7. A method as claimed In any preceding claim, wherein the 
type of acoustic signal is vocal music or pure music. 

8. A method as claimed in any preceding claim, wherein the 
calculated feature is amplitude envelope, power spectrum 
or mei-frequency cepstral coefficients. 

9. A method as claimed in any preceding claim, wherein the 
summarization is generated in terms of clustered results 
and heuristic rules related to pure or vocal music. 

10. A method as claimed in any preceding claim, wherein the 
calculated feature relates to pure or vocal, music content 
and Is linear prediction coefficients, zero crossing rates, 
or mei*frequency cepstral coefficients- 

.11. An apparatus for summarizing digital audio data 
comprising: 

a feature extractor for receiving audio data and 
analyzing the audio data to identify a representation of 
the audio data having at least one calculated feature 
characteristic of the audio data; 

a classifier in communication with the feature 
extractor for classifying the audio data on the basis of the 
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representation received from the feature extractor into a 
category selected from at least two categories; and 

a summarizer in communication with the classifier 
for generating an acoustic signal representative of a 
summarization of the digital audio data, wherein the 
summarization is dependent on the category selected by 
the classifier. 

12. An apparatus as claimed in claim 11, further comprising a 
segmentor in communication with the feature extractor for 
receiving an audio file and segmenting audio data into 
segment frames, and overlapping the frames for the 
feature extractor. 

13. An apparatus as claimed in claim 12, further comprising a 
classification parameter generator in communication with 
the classifier, wherein the classifier classifies each of the 
frames Into a category by collecting training data from 
each frame and determining classification parameters by 
using a training calculation in the classification parameter 
generator. 

14. An apparatus as claimed In any of claims 11-13, wherein 
the calculated feature comprises perceptual and 
subjective features related to music content. 

15. An apparatus as claimed In any of claims 11-14, wherein 
the training calculation comprises a statistical learning 
algorithm wherein the statistical learning algorithm is 
Hidden Markov Model, Neural Network, or Support Vector 
Machine. 
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16. An apparatus as claimed in any of claims 1 1-15, wherein 
the acoustic signal is music. 

17* An apparatus as claimed in any of claims 11-16, wherein 
the acoustic signal is vocal music or pure music. 

18. An apparatus as claimed in any of claims 11-17, wherein 
the calculated feature is amplitude envelope, power 
spectrum or mel-frequency cepstral coefficients. 

19- An apparatus as claimed in any of claims 11-18, wherein 
the summarizer generates the summarization in terms of 
clustered results and heuristic rules related to pure or 
vocal music. 

20. An apparatus as claimed in any of claims 11-19, wherein 
the calculated feature relates to pure or vocal music 
content and is linear prediction coefficients, zero crossing 
rates, or mel-frequenay. 

21. A computer program product for summarizing digital audio 
data comprising a computer usable medium having 
computer readable program code means embodied In said 
medium for causing the summarizing of digital audio data, 
said computer program product comprising: 

a computer readable program code means for 
analyzing the audio data to identify a representation of 
the audio data having at least one calculated feature 
characteristic of the audio data; 
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a computer readable program code for classifymg 
the audio data on the basis of the representation into a 
category selected from at least two categories; and 

a computer readable program code for generating an 
acoustic signal representative of a summarization of the 
digital audio data, wherein the summarization is 
dependent on the selected category. 

22. A computer program product as claimed in claim 21, 
wherein analyzing further comprises segmenting audio 

: data Into segment frames, and overlapping the frames. 

23. A computer program product as claimed In claim 22, 
wherein classifying further comprises classifying the 
frames into a category by collecting training data from 
each frame and determining classification parameters by 
using a training calculation. 

24. A computer program product as claimed in any of claims 
21-23, wherein the calculated feature comprises 
perceptual and subjective features related to music 
content. 

25 . A computer program product as claimed in any of claims 21-24. 
wherein the training calculation comprises a statistical 
learning algorithm wherein the statistical learning 
algorithm is Hidden Markov Model, Neural Network, or 
Support Vector Machine. 

26- A computer program product as claimed in any of claims 21-25, 
wherein the acoustic signal Is music. 
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27. A computer program product as claimed in any of claims 21-26, 

wherein the type of acoustic signal is vocal music or pure 
music. 

2€f . . A computer program product as claimed in any of claims 21-27, 

wherein the calculated feature is amplitude envelope, 
power spectrum or mel-frequency cepstral coefficients. 

29. A computer program product as claimed in any of claims 21-28, 
wherein the summarization is generated in terms of 
clustered results and heuristic rules related to pure or 
vocal music. 

30. A computer program product as claimed in any of claims 21-29. 
wherein the calculated feature relates to pure or vocal 
music content and is linear prediction coefficients, zero 
crossing rates, or-mel-frequency. 



INTERNATIONAL PKI^BiNARY EXAMINATION REPORT 



International application No. 
PCT/SG2002/000279 



I. Basis of the report 



X 



1 . With regard to the elements of the international application!* ' 
. I ] the international application as originally filed. 

the description, pages 1-16, as originally fUed, 

pages , filed with the demand, 

pages , received on with the letter of 

the claims, pages , as originally filed, 

pages , as amended (together with any statement) under Article 19, 

pages , filed widi the demand, 

pages 17-22, received on 7 December 2004 with the letter of 7 December 2004 

X| the drawings, pages 1/6-6/6, as originally filed, 

pages , filed with the demand, 

pages , received on with the letter of 
I I the sequence listing part of the descrq>tion: 

pages , as originally filed 
pages , filed with the demand 
pages , received on with the letter of 

With regard to the language, all the elements marked above were available or furnished to this Authority in the language m 
which the international application was filed, unless otherwise indicated under ftds item. 
These elements were available or furnished to this Authority in the foUowing language which is: 
I I ^® language of a translation furnished for the purposes of intemational search (under Rule 23.1(b)). 

I I t*ie language ofpublicationofthe international application (under Rule 48.3(b)). 

Q the language of the translation furnished for the purposes of international preliminary examination (under Rules 55.2 
and/or 55.3). 

With regard to any nucleotide and/or amino acid sequence disclosed in the international appKcation, the intemational 
prelmnnary examination was carried out on the basis of the sequence listing: 

I [ contained in the intemational application in written form. 

I I filed together with the intemational application in conq)uter readable fonn, 
I I furnished subsequentiy to this Authority in written fomi. 
I I furnished subsequently to this Authority in con^uter readable form. 

I I 7^® statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
intemational application as filed has been furnished. 

[jj The statement that the infonnation recorded in computer readable fomi is identical to the written sequence listing has 
been furnished 

4. The amendments have resulted in the cancellation of: 
I I the description, pages 
I I the claims, Nos. 
I I the drawings, sheets/fig. 

5. Q This report has been established as if (some of) the amendments had not been made, since they have been considered to 
go beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2(c)),** 



Replacement sheets which have been furnished to the receiving Office in response to an invitation under Article 14 are referred to in this 
r^rtas onguially filed and of^ not annexed to this report since they do not coiUain amendment (Rules 70, 1 6 and 70 J 7), 
Any replacement sheet containing such amendments must be referred to under item I and annexed to this report 
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V. Reasoned statement under Article 3S(2) with regard to novelty, inventive step or industrial applicability; citations 
and explanations siqpporting such statement 



1. Statement 

Novelty (N) 



Ibiventive step (IS) 



Claims 1-30 
Claims 

Claims 1-30 
Claims 

Industrial applicability (lA) Claims 1-30 

Claims 



YES 

NO 

YES 

NO 

YES 

NO 



2. ■ Citations and ei^lanaitions (Rule 70.7) 

The foUowing documents identified in the Ihtemational Search Report have been considered for the purposes of 
mis report: ^ ^ 

US 6225546. 
Novelty Or> Claims \.^n 

None of the cited documents disclose all of the features of each of the independent claims. Therefore all of the 
clanns are novel. 



Inventive Step riS'> T-^O 

The claimed invention is not obvious in the light of any of the cited documents nor is it disclosed in any obvious 
cgmbmation of them. It is also considered that it would not be obvious to a person skilled in the art in the Ught of 
common general knowledge either by itself or in combination with any of these documents 
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